Feat/embeddings submodule by juligasa · Pull Request #197 · seed-hypermedia/seed

juligasa · 2026-02-17T09:53:27Z

This PR is identical to #152 in functionality but it uses submodules instead of raw source files to not pollute the workspace with third party code. All the submodules initialization is handled by direnv so it transparent for the Developer.

Integrate llama.cpp via Go bindings for local embedding generation. Add sqlite-vec for vector storage and similarity search. Include schema migrations, daemon API changes, and proto updates.

…build - Fix sqlite-vec compilation on Alpine/musl by guarding BSD type aliases with __GLIBC__ - Dockerfile: switch to CPU-only llama.cpp build (Vulkan shaders fail on Alpine) - Dockerfile: add llama-go go.mod copy for replace directive support - CI workflows: add GGUF model caching and download steps - CI workflows: add llama.cpp build steps (CPU-only for tests, GPU for desktop releases) - CI workflows: add LIBRARY_PATH/C_INCLUDE_PATH env vars for CGO linking - ci-setup action: add Vulkan SDK and llama.cpp build per platform

Replace the vendored backend/util/llama-go directory (~1200 C/C++ files, 500K+ lines) with a git submodule pointing to seed-hypermedia/llama-go. Changes: - Remove vendored llama-go and add as git submodule - Fix go.mod: use upstream tcpipuk/llama-go module path with replace directive pointing to ./backend/util/llama-go - Update import in llamacpp.go to use upstream module path - Add submodule init guard to .envrc (before mise activation) - Add submodule existence check to mise.toml ensure-llama-libs task - Remove sync_llama_go() and generate_gpu_build_files() from ./dev script - Add submodules: recursive to 12 CI checkout steps across 10 workflows - Fix wrapper.cpp in fork: use common_chat_parser_params matching pinned llama.cpp version (commit 2eee6c866)

- Use HTTPS URL in .gitmodules so cloning works without SSH keys - Add ensure-submodule mise task to auto-init submodules - Make ensure-llama-libs depend on ensure-submodule - Move setup orchestration from mise enter hook (unreliable with direnv) to explicit mise run calls in .envrc - Result: git clone + cd into repo does everything automatically

The llama-go submodule includes the full llama.cpp source tree (~2500 files, 148MB). The previous glob copied all of them into the Please sandbox temp dir before building, causing massive disk I/O and memory pressure that could freeze the machine. Build in $WORKSPACE in-place (like seed-daemon already does) and copy only the ~9 output .a files back to the sandbox. The Makefile is kept as a src for change tracking.

Eliminate the SEED_CPU_ONLY / SEED_USE_GPU toggle that caused build conflicts when ensure-llama-libs (CPU-only) and plz (GPU) built into the same directory with different modes. Now each platform always uses the same GPU mode everywhere: - macOS: always Metal (built-in, zero deps) - Linux: always CPU-only for local dev (no Vulkan packages needed) - CI: handles per-platform GPU builds in ci-setup/action.yml Changes: - mise.toml: ensure-llama-libs detects OS and builds Metal on macOS, CPU-only on Linux. Detects stale CPU builds on macOS via missing libggml-blas.a and forces rebuild. - backend/BUILD.plz: llama-cpp and seed-daemon genrules use OS detection instead of SEED_CPU_ONLY env var. - dev: remove setup_gpu_build(), --cpu/--gpu flags from all commands. - .plzconfig: remove SEED_USE_GPU/SEED_CPU_ONLY PassUnsafeEnv. - Fork Makefile: add Metal mismatch detection alongside existing Vulkan detection in CMake cache checks.

The seed-daemon genrule's glob(**/*.c, **/*.h, **/*.cpp, **/*.hpp) captures ~2500 files from the llama.cpp nested submodule. Please hashes and copies all of them into the sandbox, causing 10+ minute builds and extreme CPU/memory usage. Exclude util/llama-go/llama.cpp/** since the seed-daemon genrule only needs the compiled .a libraries (via :llama-cpp dependency), not the C/C++ source files.

plz build takes 10+ minutes for seed-daemon due to sandbox overhead (copying files, hashing dependencies). go build directly takes ~12s from cold cache, ~3s incremental. Replace plz build //backend:seed-daemon with direct go build in all ./dev commands (build-desktop, test-desktop, run-backend, build-backend). The BUILD.plz genrule is still used by CI workflows. Also fix build-backend which still referenced the removed setup_gpu_build() function.

This reverts commit 34208d7.

…ove dead setup_gpu_build from dev script

The test waited for embedCalls==2 then immediately checked the DB, but the INSERT transaction could still be in-flight. Now also waits for runOnce to fully complete (task deleted from taskMgr) before checking DB state.

… and remove test-gpu-build - Add GGUF model cache + download steps to dev-desktop.yml (already in release-desktop.yml) - Add Windows DLL verification step to both dev-desktop.yml and release-desktop.yml - Delete test-gpu-build.yml as all its steps are now in the real workflows

juligasa added 30 commits February 19, 2026 09:47

feat(backend): add embeddings support with llama-go

2f4691d

Integrate llama.cpp via Go bindings for local embedding generation. Add sqlite-vec for vector storage and similarity search. Include schema migrations, daemon API changes, and proto updates.

fix(daemon): make cmake visible for please

bb1f5d9

download cmake if not present

2b77a32

let mise handle all cmake's shit

1a3954b

explicit c++17

5dbca8e

fix(daemon): macos linker

ef56247

fix(daemon): not using omp in macos

1916c53

fix(daemon): accelerated framework macos

a437b26

fix(daemon): not build Blas accelerator on CPU

5212781

fix(dev): use pnpm and add GPU/llama-go support to dev script

13ec52b

fix(build): use pnpm build rules instead of yarn

630f630

fix(mise): use pnpm instead of yarn in mise.toml

0aab001

feat(daemon): filters in semantic search

7ae3c0e

fix(daemon): remove semantic dedup as its slow

01271f1

fix(daemon): add search pagination

df8ede0

ci: add temporary build test workflow

38bd2f1

fix(daemon): idempotent migration

da89e7b

fix(daemon): embeddigs task feedback

84be3df

wip(frontend): activate embeddings

4d4a1b2

wip(daemon): check linux/mac compilation

e8a3e07

fix(daemon): real feedback on embeddings

5b32774

fix(ci): compile metal in macos

3b74b72

fix(ci): compile nwer macos and windows

5b73d5d

wip(ci): attempt to fix macos/win

9ca2061

fix(ci): macos build

1f5ade7

fix(ci): windows gpu support

8454f50

fiix(ci): debug windows

13ea324

fix(ci): attempt to fix windows

69551c2

attempt 2 to fix windows

d316aed

juligasa added 26 commits February 19, 2026 09:47

fix(ci): static OpenMP

3c1d570

fix(ci): no lgomp

4435c61

fix(ci): Windows dance

d80af20

wip(ci): manually add missing deps

f864cf9

fix(ci): not using MinGW to build windows

42cc0cd

fix(ci): not using MinGW to build windows v2

0cca5b4

fix(ci): not using MinGW to build windows v3

cb5c44e

wip(ci): compile sqlite differntly in windows

ba6e8be

wip(ci): compile sqlite differntly in windows v2

35685a8

mingw is back

6f28d13

include dlls in win bundle

158cdd5

bundle test build as in prod

740191d

fix(daemon): indludeBody searches comments

758769c

fix(ci): test like in prod

aabebef

fix(ci): mingw win compilation everywhere

a781f12

Revert "fix(dev): use go build directly instead of plz for seed-daemon"

66f7b51

This reverts commit 34208d7.

fix(backend): add missing libggml-blas.a to llama-cpp genrule and rem…

c8781e9

…ove dead setup_gpu_build from dev script

chore: update llama-go submodule (gitignore build artifacts)

1f882ae

fix(ci): add llama.cpp build and GGUF model to integration-tests job

5aea289

fix(backend): fix race condition in embedding indexing test

8e6ba0e

The test waited for embedCalls==2 then immediately checked the DB, but the INSERT transaction could still be in-flight. Now also waits for runOnce to fully complete (task deleted from taskMgr) before checking DB state.

juligasa marked this pull request as ready for review February 19, 2026 08:52

juligasa force-pushed the feat/embeddings-submodule branch from 3c35dda to 8e6ba0e Compare February 19, 2026 08:55

juligasa merged commit baed669 into main Feb 19, 2026
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/embeddings submodule#197

Feat/embeddings submodule#197
juligasa merged 82 commits intomainfrom
feat/embeddings-submodule

juligasa commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

juligasa commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant